A New Intonation Model for Text-to-speech Synthesis
نویسندگان
چکیده
The text-to-speech intonation model we are developing derives from both linguistics, and the acoustics and aerodynamics of speech production. Our underlying premise is that in human speech production there are physical processes intrinsic to speech production, and that some of these processes can be cognitively represented – they can therefore become part of the domain of language processing. The model is based on our general philosophy of factoring out intrinsic and extrinsic physical phenomena to create associations between physical and cognitive representations. The model is easily extended to handle variability beyond the neutral rendering of intonation, using overlays to add pragmatically determined intentional and emotional effects.
منابع مشابه
Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques
One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...
متن کاملAutomatic synthesis of natural-sounding intonation for text-to-speech conversion in dutch
A set of rules is proposed for the automatic synthesis of natural-sounding intonation as part of speech synthesis in Dutch from unrestricted text. Results of a formal perceptual evaluation show that the synthetic intonation is judged to be as natural as human intonation for isolated utterances; for texts, additional provisions are required to model contributions of text structure. It is suggest...
متن کاملStructural Data-Driven Prosody Model for TTS Synthesis
This paper introduces a new data-driven prosody model for the text-to-speech system ARTIC. The model is intended to be almost language-independent and to generate naturally sounding intonation with a link to semantics. It is based on text parametrisation using a new prosodic grammar and on automatic speech corpora analysis methods. Its performance is evaluated by results of presented listening ...
متن کاملInventory of intonation contours for text-to-speech synthesis
This paper presents an intonation model which determines intonation contours over intonation phrases. The model is described by four elements: communicative type of an intonation phrase; number of accent groups in it; position of the nuclear accent group in it; and set of target intonation points. Individualization of the model is based on semiautomatic analysis of speaker database. The model w...
متن کاملA stochastic model of intonation for French text-to-speech synthesis
This paper presents a stochastic model of French intonation contours for use in text-to-speech synthesis. The model has two modules, a linguistic module that generates abstract prosodic labels from text, and a phonetic module that generates an F0 curve from the abstract prosodic labels. This model differs from previous work in the abstract prosodic labels used, which can be automatically derive...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999